Preprocessors for Noisy Speech

نویسنده

  • George Zweig
چکیده

Objectives: The objective of this project is to develop a preprocessor for speech recognition systems operating in noisy environments. The preprocessor, consisting of a nonlinear inhomogeneous transmission line, will be realized in software, although realization in hardware in FYgl should be possible. More specifically we will: 1) Develop a nonlinear transmission line preprocessor that accurately simulates the mechanics of the mammalian inner ear at all sound pressure levels. 2) Preprocess speech with the nonlinear transmission line and show that there is a substantial improvement in the signal to noise ratio. 3) Assess the desirability and feasibility of implimenting either a digital or analog transmission line on a chip and using it as a preprocessor in the CMU, BBN, or MIT DARPA funded speech recognition systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...

متن کامل

A Preprocessor for Speech Recognition Systems Operating in Noisy Environments

Objectives: The recognition of speech in noisy environments is critical to certain DoD systems now under development. Current preprocessors for speech recognition systems, such as those based on "linear predictive coding," are linear and therefore not effective in noisy environments. The objective of this project is to develop a nonlinear preprocessor for speech recognition systems that signifi...

متن کامل

Auditory Speech Preprocessors

• Speech preprocessors are important. Small improvements at the beginning of the recognition process can lead to substantial improvements by the end. Resolving acoustic ambiguities decreases the number of possibilities that must resolved by higher level linguistic processing. • The past: Much has been learned about speech preproeessing from the inner ear of vertebrates. Historically, this appro...

متن کامل

Optimized estimation of spectral parameters for the coding of noisy speech

In this contribution we optimize a speech enhancement preprocessor such that a distortion measure in the Line Spectral Frequency (LSF) domain is minimized. We can thus improve the estimation of spectral parameters of a speech coder when the input signal to the coder is a noisy speech signal. The optimization aims at the maximum noise reduction of the enhancement preprocessor. The average maximu...

متن کامل

An Information-Theoretic Discussion of Convolutional Bottleneck Features for Robust Speech Recognition

Convolutional Neural Networks (CNNs) have been shown their performance in speech recognition systems for extracting features, and also acoustic modeling. In addition, CNNs have been used for robust speech recognition and competitive results have been reported. Convolutive Bottleneck Network (CBN) is a kind of CNNs which has a bottleneck layer among its fully connected layers. The bottleneck fea...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1989